AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal Analysis

# Multimodal Analysis

Turkish Llava Med Pipeline V1.5 Mistral 7b
Apache-2.0
Based on Microsoft LLaVA-Med v1.5 (Mistral 7B) architecture, customized for Turkish-language medical visual question answering (VQA)
Text-to-Image Transformers Supports Multiple Languages
T
nezahatkorkmaz
58
2
MATCHA ChartQA V1
ChartQA is a visual question answering model focused on extracting information from charts and answering related questions, with support for Vietnamese.
Text-to-Image Transformers Other
M
TeeA
15
0
DONUT ViChart
ViChart is a visual question answering model based on the transformers library, specializing in Vietnamese chart comprehension tasks.
Text-to-Image Transformers Other
D
TeeA
17
1
Sampel2 Docqa Layoutlmv3 Base
A document Q&A model fine-tuned based on microsoft/layoutlmv2-base-uncased. The specific training dataset is unknown.
Question Answering System Transformers
S
Tejagoud
10
0
Layoutlmv2 Finetuned Sroie Mod
A document understanding model fine-tuned from microsoft/layoutlmv2-base-uncased, suitable for structured document information extraction tasks
Large Language Model Transformers
L
Theivaprakasham
37
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase